Discovery of Genotype-to-Phenotype Associations: A Grid-enabled Scientific Workflow Setting
نویسندگان
چکیده
The heterogeneity and scale of the data generated by high throughput genotyping association studies calls for seamless access to respective distributed data sources. Toward this end the utilization of state of the art data resource management and integration methodologies such as Grid and Web Services is of paramount importance for the realization of efficient and secure knowledge discovery scenarios. In this paper we present a Grid-enabled Genotype to Phenotype scenario (GG2P) realized by a respective scientific workflow. GG2P supports seamless integration of clinico-genetic heterogeneous data sources, and the discovery of indicative and predictive clinico-genetic models. GG2P integrates distributed (publicly available) genotyping databases (ArrayExpress) and utilizes specific data-mining techniques for feature selection – all wrapped around custommade Web Services. GG2P was applied on a whole-genome SNP-genotyping experiment (breast cancer vs. normal/control phenotypes). A set of about 100 discriminant SNPs were induced, and classification performance was very high. The biological relevance of the findings is strongly supported by the relevant literature.
منابع مشابه
Architectural Plan for Constructing Fault Tolerable Workflow Engines Based on Grid Service
In this paper the design and implementation of fault tolerable architecture for scientific workflow engines is presented. The engines are assumed to be implemented as composite web services. Current architectures for workflow engines do not make any considerations for substituting faulty web services with correct ones at run time. The difficulty is to rollback the execution state of the workflo...
متن کاملArchitectural Plan for Constructing Fault Tolerable Workflow Engines Based on Grid Service
In this paper the design and implementation of fault tolerable architecture for scientific workflow engines is presented. The engines are assumed to be implemented as composite web services. Current architectures for workflow engines do not make any considerations for substituting faulty web services with correct ones at run time. The difficulty is to rollback the execution state of the workflo...
متن کاملGrid-Flow: a Grid-enabled scientific workflow system with a Petri-net-based interface
Advances in computer technologies have enabled scientists to explore research issues in their respective domains at scales greater and finer than ever before. The availability of efficient data collection and analysis tools presents researchers with vast opportunities to process heterogeneous data within a distributed environment. To support the opportunities enabled by massive computation, a s...
متن کاملFacilitating e-Science Discovery Using Scientific Workflows on the Grid
Xiaoyu Yang et al. (eds.), Guide to e-Science: Next Generation Scientific Research and Discovery, Computer Communications and Networks, DOI 10.1007/978-0-85729-439-5_13, © Springer-Verlag London Limited 2011 Abstract e-Science has been greatly enhanced from the developing capability and usability of cyberinfrastructure. This chapter explains how scientific workflow systems can facilitate e-Scie...
متن کاملApplication of Standard Semantic Web Services and Workflow Technologies in the SIMDAT Pharma Grid
The SIMDAT Pharma Grid is an industry-oriented, semantics enabled Grid environment whose purpose, among others, is to intelligently assist Biologists in conducting in-silico experiments through automating discovery, selection, composition, and invocation process of bioinformatics data services and analysis services. In the first system architecture design and prototype implementation, we apply ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009